



n 



Reliability and Validity of the 
Aggre gate Met hod-of 
Oefetroining Numbei^iOT^ 
Cigarettes^m^ed Per Da^ 

Peter W. Gariti, Ph.D., Arthur I. Alterman, Ph.D. 

Ronald N. Ehrman, Ph.D., Helen M. Pfettinati, Ph.D. 


The authors evaluated the reltabilUy of two Dretreatment as ¬ 
sessments Cscreenins and intake) of ciscirettes xmoked per day ■ 

(CPD) by the commonly used aesresate method. The validity 
of the aggregate method was also dete rmined by comparison 
~with results of the timeli ne/oUowback (TJLFB) method for the 
identical periods . The study participants were 4$ outpatients 
undergoing nicotine patch treatment. The reliability of the tw o 

aggregate method evaluations of CPD was quite high by Pear- | 

son pmduct-moment c orrelation (s') and good when based on j 

me intraciass correlati on. Correspondence between the CPD 

asses sments based on the aggregate and TJ.PP methods for the. •. 
two time-points ranged from fair (screenine) to good ('inta ke'). ' 

Overall, the study findings indicate that the aesreeate method __ \ 

provides reasonably consistent data. (Am J Addict 1998; 7:283- ' 

287) ' 


n 

-J 



:1 

3 

:3 



A ccurate measurement of smoldng is 
necessary for evaluating Uie effective¬ 
ness of cessation treatment. Obtaining an ac¬ 
curate and reliable estimate of pretreatment 
(as virell as during- and posttreatment) smok¬ 
ing patterns is an important factor not only 
in measuring outcome over time but also in 
accurately classifying subjects into light, me¬ 
dium, and heavy smokers, determining rec- 
omincnded patclr dosage, monitoring patcli 
adherence, and comparing nicotine/ 
cotinine replacement values with baseline 
level of smoking. 


Given that biological markers of choice 
(plasma, urine, or saliva cotinine) used in 
most clinical trials are neither economical 
nor adequate reflections of tlie amount of 
smoking over eMiended periods of time, as¬ 
sessment of the amount of smoldng is largely 
dependent on the respondent’s self-report 
of cigarette use. An obvious question that 
can be raised is how we know tlrat what is 
reported during screenings or just before 
the start of treatment is an accurate reflec¬ 
tion of actual smoking behavior? Arguably, a 
potential patient may deliberately over- or 


Received December 17, 1997; accepted April 3, 1998. From The Philadciphia Veterans Affairs Medical Center/ 
University of Pennsyfvania School of Medicine, Phiiadelpliia, PA. Address correspondence to Dr. Garitx, Treatment 
Research Center, 3900 Chestnut Street, Phiiadelpliia, PA 19104. 


THE AMERICAN JOURNAL ON ADDICTIONS 


283 


THIS ARTICLE IS FOR IHDIVIBUAL USE ONLY 
AND NAY NOT BE FURTHER REPRODUCED OR 
STORED ELEOTROHICALLY WITHOUT WRITTEN 
PEfitllSSION FROM THE COPYRIGHT HOLDER. 
UNAUTHORIZED REPRODUCTION MAY RESULT 
IN FINANCIAL AND OTHER PENALTIES. 


PM3001487393 


Source: https://www.industrydocuments.ucsf.edu/docs/kqyj0001 



Measuring Cigarette Smoking 


undet'estimate (he mitnber of cigafettes 
smoked because of perceived demand char¬ 
acteristics related to treatment entry. Alter¬ 
natively, a respondent may simply provide a 
gross estimate rounded off to the nearest 
pack or half-pack.^ Hence, die reported 
number of cigarettes smoked initially may 
not necessarily reflect the actual amount be¬ 
ing smoked at the beginning of treatment. 

The method for determining baseline 
cigarette usage in most smoking-cessation 
patch studies is the "aggregate” method. 
Using this method, potential recruits are 
typically asked the number of cigarettes cur¬ 
rently being smoked, which elicits a sum¬ 
mary estimate of the number of cigarettes 
smoked per day (CPD). The apparent as¬ 
sumption made in most studies is that there 
is little variability in the level of smoking be¬ 
fore the start of actual treatment. It fre¬ 
quently has been reported that except for 
attempts at quitting or other intervening cir¬ 
cumstances, such as a hospitalization, there 
is minimal fluctuation in smoking level after 
the first 5 years of smoking initiation.^ How¬ 
ever, the observation that smokers tend to 
reduce their habit during the work week in 
response to an increase in the number of 
worksites that arc smoke-free contradicts 
tliis assumption. Therefore, a '“quantity- 
frequency" (QF) metliod that evaluates the 
number of days smoked as well as the num¬ 
ber of cigarettes smoked per given day over 
an extended time-frame (i.e., the past month 
or longer) may provide a more accurate pro¬ 
file of CFD, particularly in persons with a 
variable pattern of use. 

Sobell and Sobell^ have developed a 
higlily reliable variation of the quantity- 
frequency method for obtaining patient self- 
report of alcohol use retrospectively over 
extended time periods. Timeline followback 
(TUB) reports have consistently demon¬ 
strated high test-retest reliability across mul¬ 
tiple population types (normal drinkers and 
alcoholic subjects) and have been reason¬ 
ably congruent witir collateral supportive 
data^ and single-sample biological testing. 


namely liver-function testing (SGOT/SGPT).^ 
In a recent application of TLFB to the as¬ 
sessment of substance abuse. Ehrman and 
Robbins^ reported dial TLFB estimates of co¬ 
caine and heroin use over a d-month period 
correlated liighly wiUi progirim-based weekly 
qualitative urine specimens in a sample of 
methadone-maintenance patients. 

Given the foregoing considerations, two 
basic questions were evaluated in this study: 
1) die extent to which two pretreatment ag¬ 
gregate assessments of CPD collected 1 
month apart correlate with each other (re¬ 
liability); and 2) the extent to which CPD 
data obtained using the two aggregate mea¬ 
sures correspond with a subsequent TLFB 
assessment of CPD for the same two pre- 
treatment time-points (validity). 

METHODS 


Research Participants 


Participants consisted of 49 patients j 
(30 female, 19 male) recruited and random- f 
ized into a smoking-cessation study at a 
university-based substance-abuse outpatient 
setting. The participants, chosen by tele¬ 
phone screening, were physically healthy 
and mentally stable men and nonpregnant 
women between the ages of IS and 65, who 
reported smoking at least one pack of ciga¬ 
rettes daily for at least the past month, met 
DSM-IV criteria for nicotine dependence on 
the basis of a semistructured diagnostic in¬ 
terview, and reported at least one previous 
failed attempt at smoking cessation. Pro¬ 
spective participants were excluded if they 
had any medical condition that would pre¬ 
clude the use of the patdi (e.g., unstable car¬ 
diovascular disease, allergies to the patch, 
peptic ulcer), had serious cognitive disor¬ 
ders, were currently psychotic or expressed 
current homicidal or suicidal ideation, met 
DSM-IV criteria for current non-nicotine sub¬ 
stance abuse/dependence within the past 6 . 

months, or were currently using cocaine or 
nonprescribed amphetamine. 


284 


VOLUME 7 • NUMBER 4 • FALL 1998 


PM3001487394 


Source: https://www.industrydocuments.ucsf.e(du/(docs/kqyj0001 



Gariti etal. 


Participants were recruited from a num¬ 
ber of sources, including university campus 
notices, local newspaper advertisements, 
and word-of-mouth. Written informed con¬ 
sent was obtained after subjects received a 
complete study description and passed a 
consent form quiz. 

The average subject was 43 ± 9 years of 
age; 6l% were women, and 65% were white 
(33% African-American), On average, partic¬ 
ipants had completed 15 ± 3 years of school; 
33% were currently married; and 80% cur¬ 
rently employed. They had smoked for an 
average of 25 ± 9 years and had made 7 ± 11 
previous attempts to quit smoldng. 

Procedures 


Because recruitment notices did not 
specify study requirements, research candi¬ 
dates were unaware of inclusion/excUision 
criteria regarding smoking behavior before 
they were screened. No attempt was made 
to have subjects alter cigarette use before 
the actual start of patch treatment, nor were 
they told that their responses would be 
cross-checked for consistency. 

Research technicians questioned sub¬ 
jects about their cigarette use on three 
separate occasions: 1) during a technician- 
administered structured telephone screening 
that used the aggregate method; 2) at intake 
approximately 1 month later by a technician 
as well as a psychiatrist separately confirming 
responses to a standardized smokir^-liistory 
questionnaire using die aggregate metiiod; 
and 3) at a technician-administered TLFB in¬ 
terview that took place at initiation of patch 
treatment approximately 1 montli after in¬ 
take. Fifteen of tlic 49 subjects were evaluated 
by different technicians at the three time- 
points, whereas 34 of 49 were evaluated each 
time by tlie same technician. 

About 25% of the subjects at the tele¬ 
phone screening (but not at intake) speci¬ 
fied a range of CPD when the aggregate 
method was used. In these cases, the mid¬ 
point value was selected a priori. The TLFB 


procedure used an adapted script based on 
Sobcli and SobcU’s instructions for complet¬ 
ing a timeline drinldng calendar.* l^rtici- 
pants were asked to provide their best esti¬ 
mate of daily cigarette usage across the 
preceding 6-montli period. Anchoring- 
points, such as weekdays, weekends, holi¬ 
days, paydays, days off, and the recall of ma¬ 
jor life events or personal events were 
elicited to aid the recall of past smoking 
for the TLFB. TLFB responses were tlien 
averaged, using 'the dates of die initial 
telephone screen, intake, and start of treat 
nient as the anchoring-points for determin¬ 
ing the average number of cigarettes 
smoked 1 montli before each of the tifore- 
mentioned anchoring-points. 

RESULTS 


The degree of association between the two 
estimates of CPD using the aggregate and 
TLFB methods for each of these time periods 
was evaluated using both the Pearson 
product-moment correlation, (r), and the in¬ 
traclass correlation (ICC) formula developed 
by Lin.® Although r provides information on 
the degree of ordinal relationship, it pro¬ 
vides no information on absolute agree¬ 
ment, The ICC takes absolute values into ac¬ 
count and provides a measure of extent of 
exact agreement. The ICC is clearly a more 
accurate indicator of degree of relationship.^ 

Reliability of the Aggregate Method 


The correlation between the CPD at 
screening was compared with that obtained 
at intake. The Pearson r was 0.92 
(^<0.0005), and the ICC was 0.67. Accord¬ 
ing to Cicclietti,^ an ICC between 0.60 and 
0.74 Is indicative of good agreement. The 
mean CPD at these two time-points was 
28.42 ±10.3 and 27.59 ±10.2, respectively 
(paired f 48 = 1,43; P=0.12). 


f 


i 


i 

[ 


THE AMERICAN JOURNAL ON ADDJCTlQNS 


265 


PM3001487395 


Source: https://www.industrydocuments.ucsf.edu/docs/kqyj0001 



Measuring Cigarette Smoking 


Validity of the Aggregate Method; 
Comparison With TLFB 


Screening. The relationship between the 
aggregate and TLFB assessments of CPD at 
screening was first compared. The Pearson 
r was 0.81 CP<0.0005), and ICC was 0.42. 
lliis latter value should he interpreted as in¬ 
dication of, at most, a fair degree of corre¬ 
spondence between the two measurement 
approadtes.* The mean for Uie aggregate 
method was 28.42 ± 10.3, and mean for tire 
TLFB method was 27.01 ±10.2; these did 
not differ significantly (paired 148-1-57; 
P-0.12). 

Make. The Pearson r comparing aggre¬ 
gate and TLFB CPDs was 0.85 GP<0.0005), 
and the ICC was 0.60. The value of 0.60 
for the ICC suggests good correspondence 
between the two methods of assessment. 
Tire means for the two methods were 
27.59 ±10.2 and 26.86 + 9.8, respectively. 
The paired-sample 1-test showed no signifi¬ 
cant differences (148=0.94; f’=0.35). 

The findings described above were es¬ 
sentially the same for tliose cases in which 
the same interviewer performed the various 
assessments and those in which different in¬ 
terviewers performed tliese assessments. 

DISCUSSION 


The study findings indicate a reasonably 
high level of within-subject consistency be¬ 
tween assessments at different time-points 
in reporting CPD by use of the aggregate 
method. Comparison of die findings for the 
abnegate method widi the more precise and 
presumably more accurate TLFB method re¬ 
vealed a moderate or high degree of associa¬ 
tion, depending on the measure of association 
used. 

The relationship between the aggregate 
and TLFB methods was determined to be 
higlt when absolute level was not taken into 
account (Pearson product-moment correla¬ 
tion [r]), but was only fair-to-good when the 


absolute level was considered (intraclass 
correlation [ICC]), that is, when exact agree¬ 
ment was required. Thus, the evidence sug¬ 
gests a good degree of validity for the aggre¬ 
gate method, but also points to some limits 
in its validity, given that correspondence be¬ 
tween the two assessment approaches was 
not excellent. These data reinforce the con¬ 
clusion that although the aggregate method 
may provide a satisfactory rough approxi¬ 
mation of amount of smoking, especially at 
the level of gjroup airalysis, its results are not 
as satisfactory for characterizing absolute 
changes in individual smoldng from time to 
time. 

More detailed examination of the TLFB 
data revealed tiiat smoking was significantly 
elevated on Saturdays, as compared with 
weekdays. At the same linte, smoking was 
not especially elevated on Sundays. The vari¬ 
ation between Saturday and weekday smok 
ing should be ftirtlier evaluated because it 
suggests that pahents may be more vulner¬ 
able to relapse at that time, Tliis is informa¬ 
tion that cannot be extracted from the more 
global aggregate method. 

Overall, the study findings indicate that 
the aggregate metliod provides reasonably 
consistent estimates of CPD. It may be the 
method of choice when time and cost con¬ 
siderations are important because of its rela¬ 
tive brevity, given that the TLFB takes 5 to 
20 minutes to administer. By contrast, TLFB 
may offer more detailed and presumably 
more accurate data on CPD time course and 
may be preferable when its greater cost and 
inconvenience are not critical considera¬ 
tions. 

An important albeit secondary concern 
of this research was whether there was in¬ 
dication of decreases in patients’ smoking in 
the week immediately before treatment. Our 
analyses of the relationship between TLFB 
at screening and 1 week before the start of 
treatment Indicated little variation (< 1 CPD) 
in self-reported CPD as subjects approached 
their "QUIT DATE.” 

It should be emphasized that both 


286 


VOLUME 7 • NUMBER 4 < FALL 1990 












PM3001487396 


Source: https://www.industrydocuments.ucsf.edu/docs/kqyj0001 



Gariti et a/. 





methods, the aggregate and TLFB, rely on 
patients’ self-report. Altliough the generally 
satisfactory degree of correspondence be¬ 
tween the two methods suggests that pa¬ 
tients report CPD relatively accurately, the 
ultimate validity of these selth'eport data 
awaits the development of more adequate 
objective assessment methods. 

Another important possible limitation 


of die findings is that the data were based 
on individuals who smolted at least 20 cig¬ 
arettes daily. The same results may not be 
obtained for more moderate smokers. 

This study was supported by Grant 
#£><4 10070 from the National Institute on 
Drug Abuse and by an NIDA Center Grant. 
The authors thank Gary Luck for assis¬ 
tance iti analysis of the data. 


References 


1. Kleges RC, Debon M, Ray JW; Are self-reports of 
smoking rate biased? evidence from tire second na¬ 
tional Iieaitli and nutrition examination survey. J 
Clin Epldemioi 1995; 10:1225-1233 

2. U.S. Department of Healtli and Human Services; 
The Health Consequences of Smoldng: Nicotine 
Addiction; A Report of the Surgeon General. Wash¬ 
ington, DC, U.S. Govemjnent Printing Office, 1988 

3. Sobell LC, Sobell MB: Timeline follow-back: a tech¬ 
nique for assessing self-reported alcohol consump¬ 
tion, in Measuring Alcohol Consumption; Psycho¬ 
social and Biological Methods. Edited by Litten BZ, 
Allen J. New Jersey, Humana Press, 1991, pp 2-dO 

4. Sobell LC, Sobell MB, Leo GL, et al: Reliabilit}' of a 


timeline method: assessing normal drinlters’ report 
of recent drialdng and a comparative evaluation 
across several populations. Br J Addict 1988; 
83:393-402 

5. Elitman RN, Robbins SJ; Reliability and validity of 
six-month timeline reports of cocaine and heroin 
use in a methadone population. J Consult Clin Psy¬ 
chol 1994; 62:1-8 

6.1-Kuei Lin L: A concordance correlation coefficient 
to evaluate reproducibility. Biometrics 1989; 
45:255-268 

7. Cicchetti D: Quideiines, criteria, and rules for eval¬ 
uating normed and standardlJted assessment instru¬ 
ments. Psychol Assess 1994; 6:284-290 




THE AMERICAN JOURNAL ON ADDICTIONS 


g 


287 


PM3001487397 


Source: https://www.industrydocuments.ucsf.edu/docs/kqyj0001 



